IN THE CLAIMS : 

Please cancel Claims 17-21 without prejudice to or disclaimer of the subject 
matter contained therein. 

Please amend Claims 1-3, 6-10, 13-16, 22, and 23 and add new Claims 24-26 

as follows. 

1 . (Currently Amended) A speech signal processing apparatus comprising: 
speech segment s e arch means fo r s e a r ching a s pee ch database for speech 

s e gm e nts that satisfy a phonetic environm e nt; 

HMM leaming means for computing HMMs of phonemes on th e basis of a 

search result of said s p e e ch segm e nt s e a r ch means speech segments with phonetic label in a 

speech database : 

segment recognition means for performing segment recognition of the speech 
segments in the speech database on the basis of the HMMs of the phon e m e s ; and 

registration s e gment means for dctemiining a speech segm e nt used in the 
computation of the IIMMs by said IIMM learning means and registering the a speech segment in 
a segment dictionar y, in a case where the recognition result of the speech segment by said 
segment recognition means corresponds to the phonetic label of the speech segment in 
accordanc e with a s e gm e nt r ecognition r esult of said segment r ecognition means . 



2. (Currently Amended) The apparatus according to claim 1, wherein the 
phonetic label is a diphone label, and said segment recognition means adopts diphoncs as units of 
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th e p honem e s, categorizes speech segments into four categories CC, CV, VC, and VV (C: a 
consonant, V: a vowel), and performs segment recognition in each category. 



3. (Currently Amended) The apparatus according to claim 1, wherein said 
registration s e gm e nt means comprises: 

pattern storage means which regist er s has allowable s p e e ch s e gm e nt phonetic 
label pattems, and 

said registration segment means checks if a phonetic label of a speech segment 
pattern matches with one of the allowable phonetic label pattems even if the phonetic label is not 
equal to the recognition result of said segment recognition means a s p eech s e gm e nt that is not 
succ e ssfully re cogniz e d by said s e gm e nt recognition means, and registers the speech s e gm e nt in 
the s e gment dictionaiy if th e co rr esponding sp ee ch segment pattern is found . 

4. (Previously Presented) A speech signal processing apparatus comprising: 
speech segment search means for searching a speech database for speech 

segments that satisfy a phonetic environment; 

HMM learning means for computing HMMs of phonemes on the basis of a 
search result of said speech segment search means; 

segment recognition means for performing segment recognition of the speech 
segments on the basis of the HMMs of the phonemes; and 
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registration segment means for determining and registering a speech segment 
in a segment dictionary when the number of speech segments recognized by said segment 
recognition means is not less than a predetermined value. 

5. (Previously Presented) The apparatus according to claim 4, wherein said 
registration segment means registers a speech segment in the segment dictionary if at least a 
vowel part of the speech segment is correctly recognized, even when the number of speech 
segments recognized by said segment recognition means is not more than a predetermined value. 

6. (Currently Amended) The apparatus according to claim 1, wherein said 
segment recognition means computes likelihoods of speech segments of an identical phoneme 
phonetic label , and 

said registration s e gment means registers, in the segment dictionary, speech 
segments having maximum likelihoods or having likelihoods not less than a predetermined . 
value. 

7. (Currently Amended) The apparatus according to claim 6, wherein said 
registration s e gm e nt means registers, in the segment dictionary, speech segments having upper 
values obtained by normalizing the likelihoods by durations of the speech segments or 
likelihoods having the values not less than a predetermined value. 

8. (Currently Amended) A speech signal processing method comprising: 
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a speech s e gment search step of searching a speech databas e fo r s p eech 
segnicnts that satisfy a phon e tic environnient; 

an HMM learning step of computing HMMs of phon e m e s on the basis of a 
seai ' ch r csuh in said s p eech s e gm e nt s e arch st e p speech segments with phonetic label in a speech 
database : 

a segment recognition step of performing segment recognition of the speech 
segments in the speech database on the basis of the HMMs of the phonem e s ; and 

a registration segm e nt step of dctemiining a speech s e gm e nt used in the 
computation of the IIMMs in said IIMM leamii^lstep and registering the a speech segment in a 
segment dictionar y, in a case where the recognition result of the speech segment in said segment 
recognition step corresponds to the phonetic label of the speech segment in accordance with a 
segm e nt re cognition result in said s e gment r ecognition step . 

9. (Currently Amended) The method according to claim 8, wherein the 
phonetic label is a diphone label and said segment recognition step adopts diphoncs as units of 
the phonemes and categorizes speech segments into four categories CC, CV, VC, and VV (C: a 
consonant, V: a vowel), and includes the step of performing segment recognition in each 
category. 

10. (Currently Amended) The method according to claim 8, wherein said 
registration s e gm e nt step comprises: 
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a pattern storage step of registering allowable sp ee ch segm e nt phonetic 
label patterns, and 

a said registration segment step includes a step of checking if a speech s e gment 
pattern matches a speech segment that is not successfully recognized in said s e gm e nt r e cognition 
step, and registering the speech s e gment in th e segment dictionary if th e coiTesponding s p eech 
s e gm e nt patt e rn is found whether the phonetic label of a speech segment matches with one of the 
allowable phonetic label patterns even if the phonetic label is not equal to the result in said 
segment recognition step . 

1 1 . (Previously Presented) A speech signal processing method comprising: 
a speech segment search step of searching a speech database for speech 

segments that satisfy a phonetic environment; 

an HMM learning step of computing HMMs of phonemes on the basis of a 
search result in said speech segment search step; 

a segment recognition step of performing segment recognition of the speech 
segments on the basis of the HMMs of the phonemes; and 

a registration segment step of determining and registering a speech segment in 
a segment dictionary when the number of speech segments recognized in said segment 
recognition step is not less than a predetermined value. 

12. (Previously Presented) The method according to claim 11, wherein said 
registration segment step includes a step of registering a speech segment in the segment 
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dictionary if at least a vowel part of the speech segment is correctly recognized, even when the 
number of speech segments recognized in said segment recognition step is not more than a 
predetermined value. 

13. (Currently Amended) The method according to claim 8, wherein the said 
segment recognition step includes a step of computing likelihoods of speech segments of an 
identical p hon e me phonetic label and 

said registration segment step includes a step of registering, in the segment 
dictionary, speech segments having maximum likelihoods or having likelihoods not less than a 
predetermined value. 

14. (Currently Amended) The method according to claim 13, wherein said 
registration segment step includes a step of registering, in the segment dictionary, speech 
segments having upper values obtained by normalizing the likelihoods by durations of the speech 
segments or likelihoods having the values not less than a predetermined value. 

15. (Currently Amended) A computer readable storage medium storing a 
program for implementing the method r ecited in according to claim 8. 

16. (Currently Amended) A speech signal processing svnthesis apparatus 

comprising: 
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speech synthesis means for synthesizing speech using the segment dictionary 
made by the speech signal processing apparatus according to claim 1. 

a s e gment dictionary in which speech s e gments ai 'c registered by the method 

recited in claim 8; 

language analysis means for performing language analysis of input text data; 

prosody generation means fo r gene r ating p r osody on th e basis of an analysis 
result of said language analysis means; 

sp e ech segm e nt s e lection means fo r seai - ching said segment dictionary on the 
basis of the p r osody g e n er at e d by said p r osody g e neration means to select co rr esponding speech 
segm e nts; 

speech segment modification/concatenation means for modifying and 
concat e nating the sp e ech s e gm e nts s e l e cted by said speech segm e nt s e l e ction means; and 

sp ee ch reproduction means of reproducing spe e ch on the basis of the result 
modified by said speech s e gment modification/concatenation means. 

17-21, (Cancelled) 

22. (Currently Amended) A speech synthesis method comprising: 
a speech synthesis step of synthesizing speech using the segment dictionary 
made by the speech signal processing method according to claim 8. The method according to 
claim 20, furth e r comprising a speech synthesis step of producing synthetic speech using th e 
segment dictionary. 
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23. (Currently Amended) A computer readable program storing a program for 
implementing the method recit e d in claim 20 according to claim 22 . 



24. (New) A speech signal processing apparatus comprising: 

HMM leaming means for computing HMMs of speech segments with phonetic 
label in a speech database; 

segment recognition mans for performing segment recognition of the speech 
segments in the speech database on the basis of the HMMs; 

judgment means forjudging whether the result of the segment recognition 
corresponds to the phonetic label of a speech segment; and 

storage means for storing the result of the judgment judged by said judgment 
means associated with the speech segment. 

25. (New) A speech signal processing method comprising: 

an HMM leaming step of computing HMMs of speech segments with phonetic 
label in a speech database; 

a segment recognition step of performing segment recognition of the speech 
segments in the speech database on the basis of the HMMs; 

a judgment step of judging whether the result of the segment recognition 
corresponds to the phonetic label of a speech segment; and 

a storage step of storing the resuh of the judgment judged in said judgment step 
associated with the speech segment. 



26. (New) A computer readable program storing a program for implementing 
the method according to claim 25. 
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